智能论文笔记

Promotheus: An End-to-End Machine Learning Framework for Optimizing Markdown in Online Fashion E-commerce

Eleanor Loh , Jalaj Khandelwal , Brian Regan , Duncan A. Little

分类：人工智能

2022-07-03

管理折扣促销活动（“ Markdown”）是经营电子商务业务的重要组成部分，这里的效率低下可能会严重阻碍零售商的盈利能力。解决此问题的传统方法在很大程度上取决于价格弹性建模。但是，价格弹性建模的部分信息性质，以及保护盈利能力的不可谈判的责任，意味着机器学习从业人员经常必须经过巨大的时间来定义衡量离线模型质量的策略。面对这一点，许多零售商依靠基于规则的方法，因此可以通过机器学习来捕获的盈利能力获得可观的收益。在本文中，我们介绍了两个新颖的端到端降价管理系统，以优化零售商旅程的不同阶段的赌注。第一个系统“ ITHAX”制定了无需估算的理性供应方定价策略，并且可以用作“冷启动”解决方案，以收集降价数据，同时保持收入控制。第二个系统“ Prosotheus”为价格弹性提供了一个完整的降价优化的框架。我们详细描述了特定的建模和验证程序，在我们的经验中，这对于建立在现实世界中稳健性能的系统至关重要。与我们经验丰富的运营团队在受控的在线测试中做出的决策相比，这两种降级系统都具有卓越的盈利能力，相对于手动策略，改善了86％（Promotheus）和79％（ITHAX）。这些系统已被部署以在ASOS.com上管理Markdown，并且可以在各种零售电子商务环境中进行价格优化的价格优化。

translated by 谷歌翻译

Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting

Benjamin Wilson , William Qi , Tanmay Agarwal , John Lambert , Jagjeet Singh , Siddhesh Khandelwal , Bowen Pan , Ratnesh Kumar , Andrew Hartnett , Jhony Kaesemodel Pontes

分类：计算机视觉 | 人工智能 | 机器学习 | 机器人

2023-01-02

We introduce Argoverse 2 (AV2) - a collection of three datasets for perception and forecasting research in the self-driving domain. The annotated Sensor Dataset contains 1,000 sequences of multimodal data, encompassing high-resolution imagery from seven ring cameras, and two stereo cameras in addition to lidar point clouds, and 6-DOF map-aligned pose. Sequences contain 3D cuboid annotations for 26 object categories, all of which are sufficiently-sampled to support training and evaluation of 3D perception models. The Lidar Dataset contains 20,000 sequences of unlabeled lidar point clouds and map-aligned pose. This dataset is the largest ever collection of lidar sensor data and supports self-supervised learning and the emerging task of point cloud forecasting. Finally, the Motion Forecasting Dataset contains 250,000 scenarios mined for interesting and challenging interactions between the autonomous vehicle and other actors in each local scene. Models are tasked with the prediction of future motion for "scored actors" in each scenario and are provided with track histories that capture object location, heading, velocity, and category. In all three datasets, each scenario contains its own HD Map with 3D lane and crosswalk geometry - sourced from data captured in six distinct cities. We believe these datasets will support new and existing machine learning research problems in ways that existing datasets do not. All datasets are released under the CC BY-NC-SA 4.0 license.

translated by 谷歌翻译

Almost Tight Error Bounds on Differentially Private Continual Counting

Monika Henzinger , Jalaj Upadhyay , Sarvagya Upadhyay

分类：机器学习

2022-11-09

The first large-scale deployment of private federated learning uses differentially private counting in the continual release model as a subroutine (Google AI blog titled "Federated Learning with Formal Differential Privacy Guarantees"). In this case, a concrete bound on the error is very relevant to reduce the privacy parameter. The standard mechanism for continual counting is the binary mechanism. We present a novel mechanism and show that its mean squared error is both asymptotically optimal and a factor 10 smaller than the error of the binary mechanism. We also show that the constants in our analysis are almost tight by giving non-asymptotic lower and upper bounds that differ only in the constants of lower-order terms. Our algorithm is a matrix mechanism for the counting matrix and takes constant time per release. We also use our explicit factorization of the counting matrix to give an upper bound on the excess risk of the private learning algorithm of Denisov et al. (NeurIPS 2022). Our lower bound for any continual counting mechanism is the first tight lower bound on continual counting under approximate differential privacy. It is achieved using a new lower bound on a certain factorization norm, denoted by $\gamma_F(\cdot)$, in terms of the singular values of the matrix. In particular, we show that for any complex matrix, $A \in \mathbb{C}^{m \times n}$, \[ \gamma_F(A) \geq \frac{1}{\sqrt{m}}\|A\|_1, \] where $\|\cdot \|$ denotes the Schatten-1 norm. We believe this technique will be useful in proving lower bounds for a larger class of linear queries. To illustrate the power of this technique, we show the first lower bound on the mean squared error for answering parity queries.

translated by 谷歌翻译

Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions Using Trainable Kernels and Augmentations

Devesh Khandelwal , Sean Campos , Shwetha Nagaraj , Fred Nugen , Alberto Todeschini

分类：机器学习

2022-07-28

在本文中，我们展示了一种独特的配方，可以通过将预处理技术融合到深度学习模型中来增强音频机学习方法的有效性。我们的解决方案通过通过训练而不是昂贵的随机搜索来优化超参数来加速培训和推理性能，从而从音频信号中构建可靠的蚊子探测器。此处介绍的实验和结果是MOS C提交ACM 2022挑战的一部分。在未发表的测试集上，我们的结果优于已发布的基线212％。我们认为，这是建立强大的生物声学系统的最好的现实世界中的一个例子之一，该系统在嘈杂的条件下提供可靠的蚊子检测。

translated by 谷歌翻译

Iterative Scene Graph Generation

Siddhesh Khandelwal , Leonid Sigal

分类：计算机视觉

2022-07-27

场景图生成的任务需要在给定图像（或视频）中识别对象实体及其相应的交互谓词。由于组合较大的解决方案空间，现有的场景图生成方法假设关节分布的某些分解以使估计可行（例如，假设对象在有条件地与谓词预测无关）。但是，在所有情况下，这种固定的分解并不是理想的（例如，对于相互作用中需要的对象很小且本身不可辨别的图像）。在这项工作中，我们建议使用马尔可夫随机字段中传递消息，提出一个针对场景图生成的新颖框架，并在图像上引入动态调节。这是作为迭代改进过程实现的，其中每个修改都在上一个迭代中生成的图上进行条件。跨改进步骤的这种条件允许对实体和关系进行联合推理。该框架是通过基于小说和端到端的可训练变压器建筑实现的。此外，建议的框架可以改善现有的方法性能。通过有关视觉基因组和动作基因组基准数据集的广泛实验，我们在场景图生成上显示了改善的性能。

translated by 谷歌翻译

On the Universality of Langevin Diffusion for Private Euclidean (Convex) Optimization

Arun Ganesh , Abhradeep Thakurta , Jalaj Upadhyay

分类：机器学习

2022-04-04

在本文中，我们重新审视了私人经验风险最小化（DP-erm）和差异私有随机凸优化（DP-SCO）的问题。我们表明，来自统计物理学（Langevin Exfusion（LD））的经过良好研究的连续时间算法同时为DP-SCO和DP-SCO提供了最佳的隐私/实用性权衡，$ \ epsilon $ -DP和$ $ \ epsilon $ -DP和$ （\ epsilon，\ delta）$ - dp均用于凸和强烈凸损失函数。我们为LD提供新的时间和尺寸独立统一稳定性，并使用我们为$ \ epsilon $ -DP提供相应的最佳超额人口风险保证。 $ \ epsilon $ -DP的DP-SCO保证的一个重要属性是，它们将非私人最佳界限匹配为$ \ epsilon \与\ infty $。在此过程中，我们提供了各种技术工具，这些工具可能引起独立的关注：i）在两个相邻数据集上运行损失功能时，一个新的r \'enyi Divergence绑定了LD，ii）最后一个过多的经验风险范围迭代LD，类似于Shamir和Zhang的嘈杂随机梯度下降（SGD）和iii）的LD，对LD进行了两期多余的风险分析，其中第一阶段是当扩散在任何合理意义上都没有在任何合理意义上融合到固定分布时，在第二阶段扩散已收敛到吉布斯分布的变体。我们的普遍性结果至关重要地依赖于LD的动力学。当它融合到固定分布时，我们获得了$ \ epsilon $ -DP的最佳界限。当它仅在很短的时间内运行$ \ propto 1/p $时，我们在$（\ epsilon，\ delta）$ -DP下获得最佳界限。在这里，$ p $是模型空间的维度。

translated by 谷歌翻译

Constant matters: Fine-grained Complexity of Differentially Private Continual Observation

Hendrik Fichtenberger , Monika Henzinger , Jalaj Upadhyay

分类：机器学习

2022-02-23

We study fine-grained error bounds for differentially private algorithms for counting under continual observation. Our main insight is that the matrix mechanism when using lower-triangular matrices can be used in the continual observation model. More specifically, we give an explicit factorization for the counting matrix $M_\mathsf{count}$ and upper bound the error explicitly. We also give a fine-grained analysis, specifying the exact constant in the upper bound. Our analysis is based on upper and lower bounds of the {\em completely bounded norm} (cb-norm) of $M_\mathsf{count}$. Along the way, we improve the best-known bound of 28 years by Mathias (SIAM Journal on Matrix Analysis and Applications, 1993) on the cb-norm of $M_\mathsf{count}$ for a large range of the dimension of $M_\mathsf{count}$. Furthermore, we are the first to give concrete error bounds for various problems under continual observation such as binary counting, maintaining a histogram, releasing an approximately cut-preserving synthetic graph, many graph-based statistics, and substring and episode counting. Finally, we note that our result can be used to get a fine-grained error bound for non-interactive local learning {and the first lower bounds on the additive error for $(\epsilon,\delta)$-differentially-private counting under continual observation.} Subsequent to this work, Henzinger et al. (SODA2023) showed that our factorization also achieves fine-grained mean-squared error.

translated by 谷歌翻译

Nonprehensile Manipulation of a Stick Using Impulsive Forces

Aakash Khandelwal , Nilay Kant , Ranjan Mukherjee

分类：机器人

2022-02-11

考虑了使用间歇性冲动力在三维空间中对棍子进行非骚扰操作的问题。目的是在一系列旋转对称的垂直轴对称的配置序列之间兼顾棍子。棍棒的动力学由五个广义坐标和三个控制输入描述。在应用冲动输入的两种连续配置之间，动力学在杂耍者的参考框架中以Poincar \'E映射为方便地表示。通过稳定庞加尔\'e地图上的固定点来实现与所需杂耍运动相关的轨道的稳定化。脉冲控制的Poincar \'e MAP方法用于稳定轨道，数值模拟用于证明与任意初始配置中所需的杂耍运动的收敛。在限制情况下，如果连续旋转对称配置被任意接近，则表明动力学将减少到箍上杆上稳定进动的动力学。

translated by 谷歌翻译

Simple but Effective: CLIP Embeddings for Embodied AI

Apoorv Khandelwal , Luca Weihs , Roozbeh Mottaghi , Aniruddha Kembhavi

分类：计算机视觉 | 机器学习

2021-11-18

对比语言图像预测（剪辑）编码器已被证明是有利于对分类和检测到标题和图像操纵的一系列视觉任务。我们调查剪辑视觉骨干网的有效性，以实现AI任务。我们构建令人难以置信的简单基线，名为Emplip，没有任务特定的架构，归纳偏差（如使用语义地图），培训期间的辅助任务，或深度映射 - 但我们发现我们的改进的基线在范围内表现得非常好任务和模拟器。 empclip将Robothor ObjectNav排行榜上面的20分的巨额边缘（成功率）。它使ithor 1相重新安排排行榜上面，击败了采用主动神经映射的下一个最佳提交，而且多于固定的严格度量（0.08至0.17）。它还击败了2021年栖息地对象挑战的获奖者，该挑战采用辅助任务，深度地图和人类示范以及2019年栖息地进程挑战的挑战。我们评估剪辑视觉表示在捕获有关输入观测的语义信息时的能力 - 用于导航沉重的体现任务的基元 - 并且发现剪辑的表示比想象成掠过的骨干更有效地编码这些基元。最后，我们扩展了我们的一个基线，产生了能够归零对象导航的代理，该导航可以导航到在训练期间未被用作目标的对象。

translated by 谷歌翻译

On the Application of Data-Driven Deep Neural Networks in Linear and Nonlinear Structural Dynamics

Nan Feng , Guodong Zhang , Kapil Khandelwal

分类：机器学习

2021-11-03

探讨了使用深神经网络（DNN）模型作为线性和非线性结构动力系统的代理。目标是开发基于DNN的代理，以预测给定输入（谐波）激发的结构响应，即位移和加速度。特别是，重点是使用完全连接，稀疏连接和卷积网络层的有效网络架构的开发，以及相应的培训策略，可以在目标数据用品中的整体网络复杂性和预测准确性之间提供平衡。对于线性动力学，网络层中重量矩阵的稀疏模式用于构建具有稀疏层的卷积DNN。对于非线性动力学，显示网络层中的稀疏性丢失，并探讨了具有完全连接和卷积网络层的高效DNN架构。还介绍了转移学习策略以成功培训所提出的DNN，研究了影响网络架构的各种装载因素。结果表明，所提出的DNN可以用作在谐波载荷下预测线性和非线性动态响应的有效和准确的代理。

translated by 谷歌翻译